Commutativity of Substitution and Variation in the Actions of 

Quantum Field Theory 

Zhong Chao Wij] 
Dept. of Physics, Zhejiang University of Technology, 
Hangzhou, Zhejiang 310032, China 

Abstract 

There exists a paradox in quantum field theory: substituting a field configuration which solves 
a subset of the field equations into the action and varying it is not necessarily equivalent to 
substituting that configuration into the remaining field equations. We take the S 4 and Freund- 
Rubin-like instantons as two examples to clarify the paradox. One must match the specialized 
configuration field variables with the corresponding boundary conditions by adding appropriate 
Legendre terms to the action. Some comments are made regarding exceptional degenerate cases. 

PACS numbers: ll.10.-z 04.60.-m 04.90.+e 04.50. +h 



Electronic address: zcwu2007@yahoo.com 



I. INTRODUCTION 



It is well known in quantum field theory that all field equations can be derived from 
an action principle. A field configuration satisfying the field equations is called a classical 
solution. 

By substituting into the action a field configuration which solves a subset of the field 
equations, the action becomes a functional of the remaining fields. This "reduced action" 
would describe the partial theory in the classical background of the substituted field con- 
figuration. One also expects that varying the reduced action would result in the same field 
equations for the remaining variables as those derived from the original complete action. 

However, the situation is not so simple. A paradox exists in the literature over the fact 
that substituting a field configuration which solves a subset of the field equations into the 
action and varying it is not necessarily equivalent to substituting the same field configuration 
into the remaining field equations [1]. 

A similar problem had already been encountered earlier in general relativistic cosmology, 
where imposition of symmetry on the action was found not necessarily to give the correct 
field equations [2], i.e., imposing symmetry does not necessarily commute with deriving the 
field equations from an action principle. The reason for the failure is that the symmetry 
imposition may interfere with the requirement of vanishing variations at the boundary and 
hence boundary terms cannot be assumed to vanish. 

This problem is very disturbing for quantum field theory. Scientists have never been 
lucky enough to have an ultimate theory with the first try. Instead, they usually consider 
the known regime as a classical background, which corresponds to the configuration in the 
substitution. If the above substitution leads to an incorrect reduced action for the remaining 
variables, then it is simply hopeless to make any progress based on the reduced action of the 
partial theory. A typical example of this is the consideration of quantum fields in curved 
spacetime, in which the metric is assumed to be a solution of the Einstein equations. In 
the mid-1970s, even though little was known about quantum gravity, some very important 
progress was made in quantum fields in curved spacetime, the discovery of the Hawking 
radiation in the black hole background being a prime example. 

In this paper we use two examples to show that this paradox can be clarified. These two 
examples are the S A instanton and the Freund-Rubin cosmological models. The paper can 
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be thought of as a generalization of a previous paper [3]. We shall discuss the problem of 
zero value of the cosmological constant in Section II. In Section III the 4-sphere model will 
be studied. Section IV will be devoted to the Freund-Rubin models. We summarize and 
conclude this paper in Section V. 

II. THE COSMOLOGICAL CONSTANT PROBLEM 

The most notable example of this confusion had been associated with the cosmological 
constant problem in quantum cosmology [4]. The issue has been dealt with in the separate 
publication due to its special importance [3]. However, it is instructive to give a brief 
review here. To show that the cosmological constant is probably zero, Hawking considered 
a cosmological model created from an S 4 seed instanton with the Euclidean action [4] 

where R is the scalar curvature of the space M of 4-metrics, A represents the contributions 
of "the bare cosmological constant" and the ground states of all matter fields. Here the 
contribution from a rank 3 antisymmetric tensor gauge field A vpa is singled out. It arises 
naturally in iV = 8 supergravity in four dimensions [5]. F is the field strength of A vpa . We 
use Planck units in which c — G — k — h — 1. 
The field configuration [1] [4] 

^gF^ pa = (A\y ll2 Ke^ p(T (2) 

or 

iW = (4!r 1/2 V^W* (3) 
with an arbitrary constant k solves the gauge field equation 

F^. c = 0. (4) 

Substituting this solution (2)-(3) into the action (1), one can see that the F 2 term behaves 
like an effective cosmological constant 

A e// = vr/t 2 , (5) 

and the total cosmological constant is A tota i = A + A e jj. For A totah the radius of S 4 is 
(3/Atotai) 1 / 2 , and the action is — 3n / A total . Here it is assumed that A tota i is positive. The 
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action is the negative of entropy of the created de Sitter spacetime. The relative creation 
probability of the universe is the exponential of the negative of the action [6] . It follows that 
the most probable configurations will be those with very small values of A total , and nature 
will automatically select the right value of k for this. Therefore, Hawking concluded [4]: 
"the cosmological constant is probably zero." 

However, Duff showed that after substituting the configuration into the Einstein equation, 
which is derived from the complete action (1), the Einstein equation reads 



Comparing the reduced action and substituted field equation (6), one finds the total 
cosmological constant appearing in the action is not the same as that appearing in the field 
equation [1]. What we truly observe is in (6)! 

This dilemma can be resolved by choosing the right representation for the wave function 
of the universe at the equator of the instanton, where the quantum transition from the Eu- 
clidean regime S 4 to the Lorentzian regime dS 4 occurs. The instanton should not be simply 
considered as S 4 , it must be considered as a union of a southern hemisphere joined to its time 
reversal, the northern hemisphere. The action (1) corresponds to the boundary condition 
that A vpa is given at the equator between the two hemispheres for the creation probability 
calculation. However, A vpc is the wrong representation, which suffers a discontinuity across 
the equator. Here, we have set A vpc to be regular in each hemisphere. Therefore one has to 
use the right representation, its conjugate variable ^/gF^ upa , which is continuous across the 
equator. The representation transformation is carried out by adding a Legendre term to the 
action (1) 



where T,$+n denotes the two equator boundaries for both the southern and northern hemi- 
spheres. 

The total action is 



(6) 





(7) 
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where the second equality is obtained by taking divergence of the Legendre term and using 
the gauge field equation (4). 

Substituting (2)-(3) into (8) yields 

Apparently, varying the action (9) with respect to the gravitational field will result in the 
same Einstein equation (6). Therefore, Duff's dilemma about the cosmological constant is 
dispelled and Hawking's argument is completely proven [3]. 

Two decades after the publication of Hawking's paper, many people believe that the 
cosmological constant is not zero. Therefore, Duff's paradox has remained long forgotten. 

III. THE 4-SPHERE MODEL 

However, the paradox as a whole has not been resolved. If one is dealing with the 
simple S* 4 instanton without any boundary, instead of the southern hemisphere joined to 
the northern hemisphere of S 4 in the above quantum creation scenario, then it seems that 
there is no reason to reject the representation A vpu . Even for the representation ^fgF^ vpcT it 
seems that no Legendre term has to be added since there seems to be no boundary in the 
S 4 instanton model. Therefore, the dilemma still persists. 

Now, let us study this simple S 4 model with the action (1). As one derives the Einstein 
and gauge field equations from the action, it is implicitly assumed that the metric g pu and 
the gauge potential A pup are independent variables. In varying the action, one imposes the 
condition that A^ vp is fixed at the boundary. 

There is no boundary for the metric of the S 4 model. People may wonder, why we should 
bother with the boundary value problem here. The point is that the gauge potential is 
involved. Since one has to use at least two patches to cover the manifold for the gauge 
potential, the Legendre term should arisen from the boundary between the patches under 
some circumstances. In fact, here we are not particularly interested in varying the action 
with respect to the gauge field, it has been done and the solution (2) was already found 
for (4). The key point is that the boundary term for the gauge field is also crucial for our 
main motivation, the next step: varying the reduced action with respect to the remaining 
variable, the gravitational field. 
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For convenience, the reduced action obtained from substituting (2)-(3) into (1) can be 
written as follows 



where k = g 1 ^ 2 K. Now one can vary the reduced action (10) with respect to the gravitational 
field under the condition that A^p is fixed. The condition is equivalent to F^^ being fixed, 
i.e, k is fixed using (3). Here, for simplicity, the minisuperspace ansatz is imposed. This 
results in the Einstein equation (6), as expected. 

One can equally vary the reduced action with respect to the gravitational field under 
the condition that the conjugate variable ^JgF^ U(XT is fixed. It is noted that the variables 
s fgF ,xvpc and F^ upa are not equivalent in the presence of a gravitational field. Here, we are 
using g pu and y /gF fl ' /pa as independent variables. For consistency, one has to change the 
boundary condition correspondingly, that is, add a proper Legendre term to the action. 

As emphasized earlier, for the gauge potential A pvp , it is impossible to cover the manifold 
S 4 by one patch. We can decompose the manifold into two parts: a small ball B and S 4 — B, 
which we denote as A. Following the above procedures, the Legendre term is 



One can use the gauge field equation (4) to reduce the Legendre surface term to a divergence 
term as in (8). Here, we set A V( „ to be regular in A and B, respectively. There is an 
unavoidable discontinuity of the potential across the boundary between them. It turns out 
that the contributions of the Legendre terms are — k 2 Vb/4 and — k 2 Va/4, where Vb and Va 
are their volumes. The total contribution of these two Legendre terms for the instanton is 



Now, we can let B shrink to a point, then the contributions of B to both the volume and 
boundary terms of the action tend to zero, and we can discard them. In any case, the total 
action for S 4 is (8)-(9). This is the same as that in the model for the cosmological constant 
problem. The Einstein equation can be derived from the reduced action in the same way. 

Indeed, a byproduct of the above argument is that we have proven that it is impossible to 
regularly express A pup for the whole S 4 in one piece or a single gauge. Otherwise, the Leg- 
endre term would vanish after shrinking, and this would lead to a contradiction. Therefore, 
even though S* 4 has no boundary, the inevitable singularity of A pup leads to the Legendre 
term. 




(10) 




(11) 



k 2 V S 4/A. 
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In the following, for simplicity, we use gauge freedom to force the singularity to be 
located at the center of B. This is also true for the S n (n > 2) model with a rank (n — 1) 
antisymmetric tensor Abelian gauge field. 

In summary, what we did is to cover S* 4 by the two patches: A and B. For the gauge 
potential, one can only cover it by at least two arbitrary patches. If we use the representation 
A^ up , we have to tolerate the discontinuities across the boundaries of these patches. If we 
prefer using the representation ^fgF pvpCT to avoid these discontinuities, then we have to add 
the Legendre terms, whose total contribution takes the same form as (11), where dA + dB 
is replaced by all these boundaries. The total action remains the same as (8). Our choice 
is just for simplicity. There is no topology change when one lets B shrink to a point. The 
same arguments apply to the next model. However, for the wave function of the universe 
only the representation ^/gF pupa can be used. 



IV. THE FREUND-RUBIN MODELS 

Next turn to a more complicated model, the Freund- Rubin-like instanton [7]. This model 
has been used to investigate dimensionality of spacetime in quantum cosmology [8]. The 
spacetime manifold M is described by a product of two spheres S s x S n ~ s . Apparently, the 
S* 4 model is a special case with n = s = 4. The seed for quantum creation of a black hole 
or a codimension-2 braneworld [9] is an instanton of topology S s x S 2 , where S 2 can be a 
distorted sphere with some conical singularities. It is trivial to generalize our discussion to 
this case. For simplicity, we shall only consider the Freund-Rubin-like instanton, and the 
action takes the form 

I = - [ dx n g 1 ' 2 (—(R-2A)- -F 2 ] -—I dx^h^K, (12) 
Jm V16tt 1 } 2s J 8irJ dM K J 

where R is the scalar curvature of M, K is the extrinsic curvature of the boundary dM 
with induced metric hij, A is the cosmo logical constant, and F is the field strength of a rank 
— s — 1 antisymmetric Abelian tensor A ai ... cts _ 1 . We shall use the ansatz that all indices of 
nonvanishing F components should reside in S s . The Euclidean action is obtained via an 
analytic continuation from the Lorentzian action. There is some overall sign ambiguity in 
the action. The ambiguity can be eliminated by the following consideration. In order for 
the primordial fluctuations to take the ground states allowed by the Heisenberg uncertainty 
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principle [10], the term associated with the scalar curvature of the external factor spacetime 
must be negative. But this ambiguity will not affect our discussion. 
Similarly, the gauge field equation is 

/••" ; o (i<«i<n) (13) 

and under the ansatz the solutions must take the form 

{(s\g)-^ 2 Ke ai - a ' (1< OLi < s) 
otherwise, 

where g is the metric determinant of the product space, and k is an arbitrary constant. It 
follows that 

J (s\g n _ s y 1/2 (g s ) 1/2 Ke ai ... as (1 
| otherwise, 

where g s and g n -s are the metric determinants of the factor spaces. 
The Einstein equation is 

G» v = 8n9^ - Ag^, (16) 

where the stress tensor is 

Qixv _ rp ft pai-a s -!V \_ rp rp^-as av fin\ 

u ~ r ai-o s -i r 2 S a i--- a s y ' V 11 J 

From the Einstein equation one can derive the scalar curvatures of the factor spaces 

(n - s - 1)8ttR 2 2sA 

R s = - ^ + — -, 18 

n — 2 n — 2 

( s -l)(n- S )87r/t 2 2(n-s)A 

R ns = 7 ^ 1 ~ , (19) 

s(n — 2) n — 2 

where R = (g n - s )~ 1 / 2 K, and it is noted that R depends on the metric of the factor space 
gn-s p rom these curvatures one can derive the radii of the factor spheres. 

Now we substitute the gauge field configuration (14)-(15) into the action (12), then the 
reduced action is 

/ = ~ Jjx n (igs) ll2 {9n-s) 1,2 {R - 2A) - y(<fc)- 1/2 (^-.) 1/2 A 2 ) ■ (20) 

where R = {gn-sY l/2 {9sf /2 ^- 
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One can easily derive the Einstein equation from (20) without a boundary term, consid- 
ering A ai ... as l or k as the given independent variable, as we did in the S A model, 

GT = -A<T + J^J- t (l < ^ y < s), (21) 
s 

AitK 2 a llu 

GT = -AgV- - ,(s + l<n,v<n), (22) 
s 

which is exactly the same as that derived from the complete action (12). Eqs. (18)-(19) are 
implied by (21)-(22), of course. 

One can also derive these from (20) with the Legendre term, considering ^fgF OL1 "' OLs as 
the independent variable. As usual, the corresponding Legendre term is 



/ dS ai A a2 ... as F ai "' as , (23) 

J(dS s )xS n - s 



1 Legendre 

J(dS s )xS n - s 

where S s denotes S s minus a point where the singularity of A ai ... asl is located, as in the 
S* 4 model. The integral is over (the A singularity in S s )xS n ~ s (in the sense of shrinking B 
to its center). 

Using the gauge field equation (13), the Legendre term can be written 

hegendre = ~ f d X n g 1 / 2 — , (24) 

JS s xS n - s s 



where 



The total action is 



F 2 = /•;,...„, = (gn-s)' 1 k 2 . (25) 



167T 

Using (25), one can recast (26) into the form 



total = -^-j M dxU 9 1/2 (R - 2A + Y^ 2 ) • (26) 



' total 



~ dx n [{g s ) 1 ' 2 {g n - s ) l/2 {R - 2A) + ^(^) 1/2 (^_ s )- 1/2 ^ 2 ) . (27) 

Varying the reduced total action (27) with respect to the metric for the given independent 
k, one can also derive the same Einstein equation (21)-(22). Nature is always self-consistent! 



V. CONCLUSION 



For a general case, if an original solution is extremal to the original action, then the 
solution obtained from the reduced action should be extremal too, as long as we supplement 
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the action with an appropriate boundary term. However, if the complete solution is non- 
extremal stationary to the original action, one has to be cautious, the solution that solves 
the subset of the equations might trace an orbit with constant action value, then the whole 
orbit may be the solution set for the reduced action and the original solution for the complete 
action would be one in the solution set after the substitution. We call this case degenerate. 
Since the Euclidean action of gravitational field is not positive definite, so one has to pay 
close attention to it. However, this issue does not affect our models due to our choice of 
variables. 

In summary, we have used two models to show that substituting the configuration into 
the action and then varying it is equivalent to substituting it into the field equation, except 
for the above degenerate case. The key point is to identify the variables substituted and 
provide the corresponding boundary conditions, which are implemented by adding to the 
action the proper Legendre terms needed. In general, the choice of independent variables is 
made based on convenience. The origin of the inconsistency in the earlier literature is the 
mismatch of the substituted fields and their boundary conditions. It is believed that as long 
as we keep this in mind, everything is consistent and the reduced action is viable for the 
partial theory. 
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